# Instruction Fine-tuning Optimization
Hyperclovax SEED Text Instruct 0.5B GGUF
Other
A 0.5B parameter-scale text generation model based on llama.cpp, supporting instruction-based text generation tasks
Large Language Model
H
Mungert
407
1
Llama 3.1 MIG Tulu 3 8B SFT
Apache-2.0
Llama-3.1-8B model fine-tuned using the automatically filtered 50,000-entry Tulu-3-MIG dataset
Large Language Model
Transformers

L
xsample
26
3
Captain Eris Violet V0.420 12B
Other
Captain Violet is a 12B-parameter merged model, created by combining Epiculous/Violet_Twilight-v0.2 and Nitral-AI/Captain_BMO-12B using the mergekit tool, supporting text generation tasks.
Large Language Model
Transformers English

C
Nitral-AI
445.12k
41
Mini Ichigo Llama3.2 3B S Instruct
Apache-2.0
The Ichigo-llama3s series model is a multimodal language model developed by Homebrew Research, natively supporting audio and text input comprehension. Based on the Llama-3 architecture, it is trained using WhisperVQ as an audio file tokenizer, enhancing its audio understanding capabilities.
Text-to-Audio English
M
Menlo
22
34
Tarsier 34b
Apache-2.0
Tarsier-34b is an open-source large-scale video-language model focused on generating high-quality video captions and achieving leading results in multiple public benchmarks.
Video-to-Text
Transformers

T
omni-research
103
17
Mistral 7b V0.3 Summarizer
Mistral-7B-Instruct-v0.3 is an instruction-tuned version of Mistral-7B, focusing on text generation tasks that follow human instructions.
Large Language Model
Transformers English

M
devesh-2002
22
0
Llm Jp 13b V2.0
Apache-2.0
A large-scale language model developed by the Japanese collaborative project LLM-jp, supporting Japanese and English, primarily used for text generation tasks.
Large Language Model
Transformers Supports Multiple Languages

L
llm-jp
570
14
Wizardlaker 7B
Apache-2.0
Wizard Lake 7B is a fusion model combining the next-generation WizardLM 2 7B model with a customized DolphinLake model, delivering outstanding performance.
Large Language Model
Transformers

W
Noodlz
22
2
Idefics2 8b
Apache-2.0
Idefics2 is an open-source multimodal model capable of accepting arbitrary sequences of image and text inputs to generate text outputs. It shows significant improvements in OCR, document understanding, and visual reasoning.
Image-to-Text
Transformers English

I
HuggingFaceM4
14.99k
603
Mistral 7B OpenOrca Q4 K M GGUF
Apache-2.0
This model is a GGUF format model converted from Open-Orca/Mistral-7B-OpenOrca, suitable for text generation tasks.
Large Language Model English
M
munish0838
81
2
Codellama 34b Instruct Hf
Code Llama is a series of code generation and understanding models developed by Meta, ranging from 7 billion to 34 billion parameters. This model is the 34 billion parameter instruction fine-tuned version.
Large Language Model
Transformers Other

C
meta-llama
1,756
17
Tinymistral 6x248M Instruct
Apache-2.0
A language model fine-tuned based on the Mixture of Experts (MoE) architecture, which fuses multiple models through the LazyMergekit framework and performs excellently in instruction tasks.
Large Language Model
Transformers English

T
M4-ai
1,932
9
Mistral 7B Instruct V0.2 Sparsity 30 V0.1
Apache-2.0
Mistral-7B-Instruct-v0.2 is an enhanced instruction fine-tuned large language model based on Mistral-7B-Instruct-v0.1, achieving 30% sparsity through Wanda pruning method without requiring retraining while maintaining competitive performance.
Large Language Model
Transformers

M
wang7776
75
1
Mindllm 1b3 Chat Zh V2.0
Apache-2.0
MindLLM 1.3B is a 1.3 billion-parameter Transformer model jointly developed by the Beijing Engineering Research Center of Massive Language Information Processing and Cloud Computing Applications and the Southeast Institute of Information Technology, Beijing Institute of Technology, supporting Chinese and English dialogue generation.
Large Language Model
Transformers Supports Multiple Languages

M
bit-dny
122
15
Phind CodeLlama 34B V2
Phind-CodeLlama-34B-v2 is an open-source code generation model fine-tuned from Phind-CodeLlama-34B-v1, achieving 73.8% pass@1 on HumanEval tests, representing the state-of-the-art among open-source models.
Large Language Model
Transformers

P
Phind
34.68k
832
Llama 2 7B 32K
An open-source long-context language model fine-tuned based on Meta's original Llama-2 7B model, supporting 32K context length
Large Language Model
Transformers English

L
togethercomputer
5,411
538
Chinese Llama 2 7b
Openrail
Fully open-source and commercially usable Chinese version of the Llama2 model with Chinese-English supervised fine-tuning dataset. The input format strictly adheres to the llama-2-chat standard and is fully compatible with all optimization solutions for the original llama-2-chat model.
Large Language Model
Transformers Supports Multiple Languages

C
LinkSoul
534
317
Codet5p 2b
Bsd-3-clause
CodeT5+ is an open-source family of large language models for code, supporting code understanding and generation tasks, featuring an encoder-decoder architecture with flexible mode switching.
Large Language Model
Transformers

C
Salesforce
745
35
Pythia Chat Base 7B
Apache-2.0
A 7-billion-parameter open-source dialogue model fine-tuned from EleutherAI Pythia-7B, trained on over 40 million instructions using 100% carbon-negative computing resources
Large Language Model
Transformers English

P
togethercomputer
194
68
Featured Recommended AI Models